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Get next substring. 
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Receive search query 
from user. 
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Generate list of relevant 
documents based on the 
query. 



203 



Select subset of relevant 
documents as first k 
documents. 
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Generate first substring, 
s, from user query. 
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Find fraction of 
documents in subset of 
documents that 
contain s (FRACFsD. 
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grocessed^ 

[ves 



208 



Select longest substnngs 
in which FRAC[s] > a 
predetermined 
threshold f. 
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It two or more substnngs 
overlap, select substring 

with the higher FRAC 
value. 
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Fig. 3 
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Identify Q(s) as the set of 
documents, d, in top k 
that contain s. 
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Let. 

m= 



1 



k-h\og{RANK{d)) 
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Calculate N as: 

(sum over each of the top 
k documents) 




